Acceleration of the Boundary Element Library BEM4I on the Knights Corner and Knights Landing Architectures
نویسندگان
چکیده
The aim of the poster is to present the acceleration of the boundary element method (BEM) by the Intel Xeon Phi technology. The poster provides brief overview of BEM followed by the discretization approach and efficient numerical assembly of the BEM matrices. We discuss its parallelization by OpenMP in shared memory and the SIMD vectorization necessary to exploit the full potential of the Xeon and Xeon Phi architectures. We present numerical experiments performed both on the Knights Corner (KNC) coprocessor and the Knights Landing (KNL) standalone processor.
منابع مشابه
Co-design of a Particle-in-Cell Plasma Simulation Code for Intel Xeon Phi: A First Look at Knights Landing
Three dimensional particle-in-cell laser-plasma simulation is an important area of computational physics. Solving state-of-the-art problems requires large-scale simulation on a supercomputer using specialized codes. A growing demand in computational resources inspires research in improving efficiency and co-design for supercomputers based on manycore architectures. This paper presents first per...
متن کاملDD-αAMG on QPACE 3
We describe our experience porting the Regensburg implementation of the DD-αAMG solver from QPACE 2 to QPACE 3. We first review how the code was ported from the first generation Intel Xeon Phi processor (Knights Corner) to its successor (Knights Landing). We then describe the modifications in the communication library necessitated by the switch from InfiniBand to Omni-Path. Finally, we present ...
متن کاملDD-$\alpha$AMG on QPACE 3
We describe our experience porting the Regensburg implementation of the DD-αAMG solver from QPACE 2 to QPACE 3. We first review how the code was ported from the first generation Intel Xeon Phi processor (Knights Corner) to its successor (Knights Landing). We then describe the modifications in the communication library necessitated by the switch from InfiniBand to Omni-Path. Finally, we present ...
متن کاملLandau Collision Integral Solver with Adaptive Mesh Refinement on Emerging Architectures
The Landau collision integral is an accurate model for the small-angle dominated Coulomb collisions in fusion plasmas. We investigate a high order accurate, fully conservative, finite element discretization of the nonlinear multi-species Landau integral with adaptive mesh refinement using the PETSc library (www.mcs.anl.gov/petsc). We develop algorithms and techniques to efficiently utilize emer...
متن کاملAccelerating gravitational microlensing simulations using the Xeon Phi coprocessor
Recently Graphics Processing Units (GPUs) have been used to speed up very CPU-intensive gravitational microlensing simulations. In this work, we use the Xeon Phi coprocessor to accelerate such simulations and compare its performance on a microlensing code with that of NVIDIA’s GPUs. For the selected set of parameters evaluated in our experiment, we find that the speedup by Intel’s Knights Corne...
متن کامل